Joint MCE estimation of VQ and HMM parameters for Gaussian mixture selection
نویسندگان
چکیده
Vector Quantization (VQ) has been explored in the past as a means of reducing likelihood computation in speech recognizers which use hidden Markov models (HMMs) containing Gaussian output densities. Although this approach has proved successful, there is an extent beyond which further reduction in likelihood computation substantially degrades recognition accuracy. Since the components of the VQ frontend are typically designed after model training is complete, this degradation can be attributed to the fact that VQ and HMM parameters are not jointly estimated. In order to restore the accuracy of a recognizer using VQ to aggressively reduce computation, joint estimation is necessary. In this paper, we propose a technique which couples VQ frontend design with Minimum Classification Error training. We demonstrate on a large vocabulary subword task that in certain cases, our joint training algorithm can reduce the string error rate by 79% compared to that of VQ mixture selection alone.
منابع مشابه
A continuous density interpretation of discrete HMM systems and MMI-neural networks
The subject of this paper is the integration of the traditional vector quantizer (VQ) and discrete hidden Markov models (HMM) combination in the mixture emission density framework commonly used in automatic speech recognition (ASR). It is shown that the probability density of a system that consists of a VQ and a discrete classifier can be interpreted as a special case of a semicontinuous mixtur...
متن کاملParameter clustering and sharing in variable-parameter HMMs for noise robust speech recognition
Recently we proposed a cubic-spline-based variableparameter hidden Markov model (CS-VPHMM) whose mean and variance parameters vary according to some cubic spline functions of additional environment-dependent parameters. We have shown good properties of the CS-VPHMM and demonstrated on the Aurora-3 corpus that MCE-trained CSVPHMM greatly outperforms the MCE-trained conventional HMM at the cost o...
متن کاملOnline Bayesian tree-structured transformation of HMMs with optimal model selection for speaker adaptation
This paper presents a new recursive Bayesian learning approach for transformation parameter estimation in speaker adaptation. Our goal is to incrementally transform or adapt a set of hidden Markov model (HMM) parameters for a new speaker and gain large performance improvement from a small amount of adaptation data. By constructing a clustering tree of HMM Gaussian mixture components, the linear...
متن کاملAnewmethod Used in Hmm Formodeling Frame Correlation
In this paper we present a novel method to incorporate temporal correlation into a speech recognition system based on conventional hidden Markov model (HMM). In our new model the probability of the current observation not only depends on the current state but also depends on the previous state and the previous observation. The joint conditional PD is approximated by non-linear estimation method...
متن کاملPhoneme recognition system based on HMM with distributed VQ codebook
In this paper a new variant of HMM named distributed VQ HMM (DVQHMM) is presented. Its main characteristic is the use of a code books distributed on HMM states with a new manner of HMM parameters estimation. Procedures for training and HMM evaluation of each recognition unit are described. Comparative results on an isolated phoneme recognition system are shown, between DVQHMM and conventional V...
متن کامل